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We extend our density matrix embedding theory (DMET) [Phys. Rev. Lett. 109 186404 (2012)] from lattice models to 
the full chemical Hamiltonian. DMET allows the many-body embedding of arbitrary fragments of a quantum system, 
even when such fragments are open systems and strongly coupled to their environment (e.g., by covalent bonds). In 
DMET, empirical approaches to strong coupling, such as link atoms or boundary regions, are replaced by a small, 
rigorous quantum bath designed to reproduce the entanglement between a fragment and its environment. We describe 
the theory and demonstrate its feasibility in strongly correlated hydrogen ring and grid models; these are not only 
beyond the scope of traditional embeddings, but even challenge conventional quantum chemistry methods themselves. 
We find that DMET correctly describes the notoriously difficult symmetric dissociation of a 4x3 hydrogen atom grid, 
even when the treated fragments are as small as single hydrogen atoms. We expect that DMET will open up new ways 
of treating of complex strongly coupled, strongly correlated systems in terms of their individual fragments. 



Embedding techniques are powerful tools for enabling 
high-level many-body treatments on system sizes they can- 
not normally reach. They work by dividing a chemical sys- 
tem into fragments, each of which is handled individually; 
the interaction with the other fragments — the environment — 
is treated in a simplified way. In this communication we are 
concerned with embeddings for fragments which are strongly 
coupled to their environment, for example via covalent bonds. 

A particular embedding is characterized by the precise 
manner in which the environment, and its influence on the 
fragment, are represented. To date, most techniques represent 
the environment through a one-particle embedding potential 
V. For example, in QM/MM methods, v is obtained through 
electrostatics or polarization interactions,^ ^ while in ab-initio 
DPT embedding, v is the derivative of the non-additive energy 
functional.-'' * However, an embedding potential, regardless of 
how it is obtained, cannot represent the effect of the environ- 
ment on the many-body fragment state when the coupling is 
strong: This is illustrated by the simple example of embed- 
ding a hydrogen atom A within a hydrogen molecule AB. If 
hydrogen atom B is represented by an embedding potential v, 
then hydrogen atom A appears as a closed system with a single 
electron; thus, any wavefunction description of the fragment, 
regardless of the choice of v, provides no information on elec- 
tron correlation. 

This failure of potential based embedding is rooted in the 
fact that the fragments are open systems that are entangled 
with their environment. Is it possible even in principle to for- 
mulate an embedding description of an open fragment? This 
question has far reaching consequences; an affirmative answer 
would imply, for example, that one could in principle exactly 
calculate the properties of a bulk diamond crystal by treating 
a single embedded carbon atom. Here we argue that this is in 
fact the case. The key is to represent the environment not by 
a potential, but rather through a quantum bath that reproduces 
the entanglement of the embedded fragment with the full en- 
vironment. In existing embedding approaches, empirical link 
atoms or boundary regional can be thought of as baths, but 
here we show that an exact bath, that exactly reproduces all 
many-body environment effects, can in fact be defined. In re- 
alistic systems, the construction of this exact bath is not prac- 
tical. However, as we show below, this point of view naturally 



leads to a practical embedding method which we call density 
matrix embedding theory (DMET). 

We have previously introduced DMET in the context of 
fermionic lattice modelJ^ where it showed excellent perfor- 
mance; in particular, in comparison to the more complex dy- 
namical mean-field theory (DMET). Here we describe the ex- 
tension and modification of DMET to treat inhomogeneous 
systems and the full chemical Hamiltonian with long-range 
interactions, namely 

H = Y^h!x+\YyX- (1) 
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We note especially that DMET is defined through simple lin- 
ear algebra and avoids the numerical issues of inverse prob- 
lems associated with constructing potential embeddings, as 
found in ab-initio DFT embedding (see also Ref. [TO] another 
solution to the inverse problem). 

We use the following notation: The full quantum system 
Q is spanned by an orthogonal one-particle basis (e.g., an 
symmetrically orthogonalized atomic orbital basis). The or- 
thogonal basis functions are indexed by r,s,t,u (e.g., Eq. ([T]i) 
and referred to as sites. Q is divided into sets of sites called 
fragments, such that each site occurs in exactly one fragment. 
Fragments are handled sequentially, and for each fragment A, 
the union of the other fragments is treated as environment B. 
\A\ is the number of sites in set A. 

First, let us review the the exact formal bath that exactly 
embeds a given fragment A. Let Q be the full quantum system, 
and 1^) be an eigenstate of Q. |^) may expanded as 

\^) =Y^,^mM)\^j) (2) 

where ja,) and |)3,) are states in the Fock space spanned by 
the fragment A and the environment B, respectively. Simple 
algebra shows that 

1^) = Wij !«,■> I^;> = E, !«,■> (E,. w.j IPj) ) 
= E>') 1^') =^ Hi,' ^'i' 1^'-') (3) 

where \Xi) — Y,j Wij \Xi) is the orthogonalized set of 

1^,) states. (This rewriting is closely related to the Schmidt 
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decomposition of quantum information theory^. Note that 
in the last line, although \Xii) are states in the environment B, 
there are only of them: the dimension of the Fock space of 
A. This set of special environment states defines the bath. We 
see that (i) no matter how large the envkonment, in a given 
state 1^), a fragment A can only be entangled with Ma envi- 
ronment states in B. Thus the entanglement effect of the envi- 
ronment fully represented by a bath of the same size as the 
fragment it is embedding. This combination of the fragment 
with its bath, we refer to as the embedded system, (ii) If |^) is 
an eigenfunction of a Hamiltonian H (that acts in the full sys- 
tem Q), it is also an eigenfunction of a projected Hamiltonian 
defined in only the embedded system, H' = PHP, where 



(4) 



H' is the embedding Hamiltonian. 

We thus have established that fragments can, in principle, 
always be exactly embedded by baths no larger than the frag- 
ments themselves. While exact, this result is purely formal, 
because to construct the bath states we require knowl- 

edge of the state |^) of the full system Q. However, it nat- 
urally suggests a practical approximation: construct the bath 
from an approximate state of Q, |<I>), and use this approxi- 
mate bath in a subsequent high-level treatment of the embed- 
ded fragments. This is the combination we aim for in DMET. 

A simple choice for |<I>) is a Slater determinant (for ex- 
ample, as obtained from a mean-field treatment of the full 
system). For a Slater determinant, the associated bath and 
embedding Hamiltonian are particularly simple: they can be 
obtained from single-particle linear-algebra rather than the 
many-particle decomposition in Eq. (|3]l, as we now show. 
First note that, for any |^), the fragment many -body states 
I a, ) in Eq. ([3]) live in the Fock space spanned by the one- 
particle fragment sites ^{\i)), i G A. In the special case where 
1^) — |<I>) is a determinant, also the bath states \Xi) live in a 
Fock space defined by one-particle bath orbitals {\b)} (at most 
|A|), multiplied by a common core determinant. This is seen 
as follows. Let \p) = Y.rC], \r) denote the occupied orbitals 
of |<I>), where p = \...N,r ^Q. Let 

Sp, = Y.ieA^P\i)m (5) 

define the overlap matrix S of the orbitals projected onto the 
sites of fragment A. Then the eigenvectors of S define a rota- 
tion of the occupied orbitals \p) \p) which divides them 
into two sets: a set of A^ — |A| occupied orbitals with zero 
eigenvalues, and thus without any component on the fragment 
sites, and a set of |A| occupied orbitals with non-zero eigen- 
values, which have overlap with the fragment sites. We call 
the former "pure environment orbitals", and the latter "entan- 
gled orbitals". Projecting the entangled orbitals p = 1 . . . |A| 
onto the environment sites B and normalizing then yields a set 
of bath orbitals \b) of the same number as fragment sites. 
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Rewriting |0) in terms of the rotated orbitals \p), and express- 
ing each 1/5) in terms of its fragment, bath, and pure environ- 
ment components, we see that the many-body bath states \Xi) 



span the same space as ,^{\b)) ® dst{e\e2 ■ ■ ■e;^_\A^, where 
det(eie2 ■ ■ ■eN-\A\) is the determinant of pure environment or- 
bitals. In other words, when split across a fragment and envi- 
ronment, a determinant |<I>) appears as a CAS-CI (complete 
active space configuration interaction) expansion in a half- 
filled active embedding basis of fragment plus bath orbitals, 
with a core determinant of pure environment 

orbitals \e). 

We next construct the embedding Hamiltonian correspond- 
ing to the Slater determinant |<I>). Formally, this is defined 
from the many-body projection Eq. Q, but for the case of a 
Slater determinant, it can be constructed by a simple change 
of single-particle basis. We define the embedded Hamilto- 
nian H' by projecting H into the space ,j^{\i)) ® ^{\b)) ® 
det(eie2 ■ ■ ■^n-\a\)- This is equivalent to transforming H into 
the active space of the embedding basis (fragment plus bath 
orbitals), and including a core contribution from the pure en- 
vironment determinant, det(eie2 ■ ■ ■eN-\A\)- Denoting the em- 
bedding basis by labels v, w, x, y, and its representation in 
terms of the full system sites by \x) = |r), we find 
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If the full system |<I>) was obtained from a Hartree-Fock cal- 
culation, then carrying out a Hartree-Fock calculation in this 
embedded system with the embedding Hamiltonian H' yields 
a mean-field Fock operator /' for which |<I>) is an eigenstate. 

In Fig. [T] we numerically demonstrate the exactness of the 
above mean-field embedding by the following process: (i) a 
Hartree-Fock calculation is performed on a molecule, (ii) the 
molecule is split into arbitrary groups of atoms as fragments. 
For each fragment, an embedding is constructed (bath orbitals 
and H') and a Hartree-Fock calculation is run on the embed- 
ded system, (iii) the system is reassembled by adding up the 
energies and electrons located on the various fragments. We 
note that only the electrons and energy contributions associ- 
ated with the fragment sites should be considered when re- 
assembling the system, not the contributions associated ex- 
clusively with bath or pure environment orbitals, as this would 
lead to double counting. To that end, we define the energy of 
a fragment A using the one- and two-particle density matrices, 
7, F, with at least one index in fragment A, 



ieA.s ieA.stu 
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Note that 7 and F include the contributions from the pure en- 
vironment orbitals (the core determinant). As we see from 
Fig.[T] the embedding allows an arbitrary fragmentation into 
open fragments to be exactly reassembled (note the fractional 
electron numbers!), recovering the exact Hartree-Fock energy 
of the wave function |<I>) used to contmct it. 

We now return to the DMET. Recall that here we still con- 
struct the embedding based on a mean-field |<t>) of the full 
system Q, but use it to embed high level, calculations on the 
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-38.03578679 
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Full system: 

DF-RHF Energy: 
Electrons: 



-605.99038245 
100 



Embedded fragments: 

Sum of energies: -605.99038245 
Sum of electrons: 100.000000 



Details for fragment 6: 
1 6 orbitals: 8 fragment, 8 bath 
1 6 electrons: 8.862371 on fragment sites 
7.137629 on bath orbitals 



FIG. 1 . Example for HF in HF embedding: A molecule is split into random atomic fragments. For each fragment, an embedding is constructed, 
and the embedded system is treated with Hartree-Fock. The left panel shows the obtained distribution of energy and electrons amongst the 
fragments. The right inset shows how the open system embedding (e.g., a fractional electron number) is realized by coupling to a bath. 



fragments, rather than Hartree-Fock. For each fragment A, 
the high level calculation on the embedded system (fragment 
sites, bath orbitals, and Hamiltonian H') yields a correlated 
\^a)- Here we require an additional self-consistency cycle to 
ensure a consistent fragment description by the mean-field |<I>) 
and the high-level embedded |^a)- Although the choice of 
self-consistency condition is not unique, we note that a mean- 
field state is characterized by its one-particle density matrix 
(<I>|fljtflj|<I>), and thus it is convenient to enforce consistency 
at the level of the density matrices by minimizing the metric 

A = L L ll(*|flkl*>-(^A|flk-I^A)||'. (11) 

A rs£A 

Here we define the density matrix difference only over (in- 
tra)fragment sites for simplicity, but very similar results are 
obtained by defining the difference over fragment plus bath 
sites for each embedding, as we did in our earlier work.^ |<I>) 
is then obtained from a mean-field Fock operator / augmented 
by a set of one-particle operators for each fragment, with 
= for r ^ A or s ^ A. The ua are chosen to minimize A 
and capture the correlation effects on the one-particle density 
matrix. The embedded Hamiltonian is also augmented with 
the correlation operators on the fragments other than the one 
currently being considered, projected into the embedding ba- 
sis. The DMET self-consistency cycle is thus: 

1. The full system is treated at the Hartree-Fock level us- 
ing the Fock operator / + Y.A ua to determine |<I>). Ini- 
tially all UA are zero. 

2. For each fragment, |<I>) is used to construct an embed- 
ding basis. The embedding Hamiltonian for fragment 
A, H'j^, is obtained by projecting H + Y.a'^a "a' into the 
embedding basis following Eq. (|7]i. The embedded frag- 
ment's state Y^a) is calculated at a correlated level, for 
example, with full configuration interaction (FCI). 

3. For each fragment A, we adjust the correlation op- 
erator Ua — Y^rseA^^s^r minimize the difference 
between the Hartree-Fock one-particle density matrix 
and the correlated one-particle density matrix, A^ = 

4. The cycle is iterated until all converge. 
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FIG. 2. Symmetric stretching of a Hio ring: Shown are the exact 
results (FCI), the mean-field results (RHF), and the 1-site DMET 
results. In 1-site DMET, the system is fragmented into individual 
atoms treated with FCI, and embeddings are constructed from full 
system RHF. 



In order to test whether the DMET provides reasonable 
results, we now consider simple model systems exhibiting 
strong correlation; namely, hydrogen rings, chains, and grids. 
Such systems have recently emerged as a rich testbed for new 
correlation methods, as the strength of the correlation can be 
readily tuned from weak to strong by changing the hydrogen 
atom spacing.n^ElNote that such systems show high degener- 
acy and are difficult to handle even with full correlation treat- 
ments. Furthermore, as we argued in the introduction, any 
choice of fragment will be strongly coupled to the rest of the 
system, so potential based embeddings cannot describe them. 
All calculations employ a minimal hydrogen basis consisting 
of orthogonalized Ls-like AO functions obtained from an un- 
derlying cc-pVTZ basis, except for the H50 calculation, for 
which we use a ST0-6G basis to retain compatibility with ear- 
lier references. For the high level treatment of the embedded 
systems, we use FCI. All calculations are spin-adapted, and 
both the mean-field (restricted HF) and correlated (FCI) calcu- 
lations use singlet wave functions. As DMET self consistency 
metric we used Eq. ( [TT| , except for the 3-site calculation on 
the 4x3 grid, where the previous metrid^l was used to avoid 
convergence problems at r > 3.8flbohr- 

As the simplest non-trivial example, we investigate the 
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FIG. 3. Symmetric stretching of a H^o ciiain: Shown are reference 
DMRG 2" and CCSD(T) data, the mean-field results (RHF), the 1-site 
DMET results, and the DMFT results by Lin et al.''' 
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FIG. 4. Symmetric stretching of a 4 x 3 hydrogen grid. In the 3-site 
DMET, the system is fragmented into four columns of three hydro- 
gens each. 



symmetric dissociation of a ring of ten hydrogen atoms (that 
is, all bonds are simultaneously stretched). The system is frag- 
mented into individual Is orbitals of the hydrogen atoms, and 
thus each site is one fragment. The results are shown in Fig. [2] 
The 1-site DMET calculation almost exactly reproduces the 
reference FCI curve. Note that each correlated fragment cal- 
culation corresponds to a FCI calculation on only two orbitals: 
the H Is orbital, and a single bath orbital, and is thus a trivial 
3x3 matrix diagonalization.GS 

As second example, we choose an inhomogeneous system: 
the linear H50 chain. FCI on this system would require on 
the order of 10^^ determinants, so truly exact results cannot 
be calculated. However, near-exact reference data can be ob- 
tained by the quantum chemistry density matrix renormaliza- 
tion group (DMRG). We here take the data from Hachmann 
et alF^ This particular system has also been the subject of a 
recent DMFT study by Lin et al.,'^the results of which are 
shown for comparison. It is clear that the 1-site DMET once 
again closely reproduces the reference values. Again the frag- 
ment calculations are two orbital FCI and thus numerically 
trivial. It is noteworthy that in this case DMET does better 
than the more complex DMFT, likely due to its ability to treat 
long-range interactions with the bath beyond mean-field. 



For a more challenging system, we now turn to a two- 
dimensional inhomogeneous system, the 4x3 hydrogen 
grid. This system is very pathological — it is non-binding at 
Hartree-Fock level, and converging the normally very robust 
FCl^' required hundreds of iterations for some points. As 
shown in Fig.|4] even here the 1-site DMET qualitatively re- 
produces the FCI binding curve. The calculations remain as 
trivial to perform as for the one-dimensional systems. Addi- 
tionally, if we embed entire columns of atoms, carrying out a 
3-site DMET, the agreement between the embedded and the 
reference results becomes almost perfect. 

The accuracy of these results may seem surprising, given 
that the systems are strongly correlated but the embedding is 
obtained from an uncoiTelated, qualitatively incorrect, mean- 
field |<I>). However, it is only the bath states that are deter- 
mined from the mean-field theory. Once those states are de- 
termined, the embedded Hamiltonian is constructed exactly, 
and the coupling between the fragment sites and the bath or- 
bitals is obtained by FCI, not mean-field. This allows the frag- 
ment state to correctly transition from the delocalized regime 
of weak correlation to the entangled spin regime of strong cor- 
relation. As long as the entanglement is reasonably local and 
does not involve the pure environment orbitals, we can expect 
good results. 

The robustness and simplicity of DMET, even in the pres- 
ence of strong coupling and strong correlation, makes it 
unique amongst current embedding approaches, and suggest 
that it could be useful in a wide range of applications. The 
next step will be to apply the theory to more realistic and 
larger scale chemical problems. We are now pursuing these 
studies. 
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